An Arbitrary Gini Index for the Redundant Feature Datasets Analysis
نویسندگان
چکیده
منابع مشابه
An elementary characterization of the Gini index
The Gini index is one of the most used indicators of social and economic inequality. In this paper we characterize the Gini index as the unique function that satis es the properties of scale independence, symmetry, standardization and separability. Furthermore, we propose a simpler way to compute it. Keywords: Gini index; income inequality; axiomatization. JEL Classi cation: D31, D63, I31. 1 In...
متن کاملFast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets
Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...
متن کاملAn (almost) unbiased estimator for the S-Gini index∗
This note provides an unbiased estimator for the absolute S-Gini and an almost unbiased estimator for the relative S-Gini for integer parameter values. Simulations indicate that these estimators perform considerably better then the usual estimators, especially for small sample sizes. 1 The absolute and relative S-gini indices Assume that income is distributed according to a continuous and diffe...
متن کاملHow Redundant Is It? - An Empirical Analysis on Linked Datasets
Data redundancy resides in most, if not all, information systems. Linked Data is no exception. Existing approaches try to avoid data redundancies by proposing compression techniques or succinct data structures. However, data redundancies in Linked Data are useful sometimes, e.g., ontology based data access can make use of A-Box redundancies to avoid unnecessary query rewritings. Either you want...
متن کاملThe Gini Index of Speech
In which representation is speech most sparse? Time-scale? Time-frequency? Which window generator and length should be used to create the sparsest decomposition? To answer these questions, we propose the Gini index, which is twice the area between the Lorenz curve and the 45 degree line, as a measure of signal sparsity. The Gini index, introduced in 1912, is one of the most common measures of i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Indian Journal of Science and Technology
سال: 2017
ISSN: 0974-5645,0974-6846
DOI: 10.17485/ijst/2017/v10i4/110665